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Inhibition of Major Groove DNA Binding Proteins Bv Modified Polvamides 

The U.S. Government has certain rights to this invention pursuant to Grant Nos. 
GM 26453, 27681, and 47530 awarded by the National Institute of Health. 

5 CROSS-REFERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part of PCT/US97/03332 filed February 20, 
1997, Serial No. 08/853,522 filed May 8, 1997 and PCT/US97/12722 filed July 21, 1997 
which are continuation-in-part applications of Serial No. 08/837,524 filed April 21, 1997 
10 and Serial No. 08/607,078 filed February 26, 1996; provisional application 60/042,022, 
filed April 16, 1997; provisional application 60/043,444 filed April 8, 1997; 
PCT/US98/[MBHB 97,853] filed January 21, 1998; PCT/US98/[MBHB 97,854] filed 
January 29, 1998; and, PCT/US98/[MBHB98,016], filed January 29, 1998. The 
specifications of these applications are incorporated herein by reference. 



15 



BACKGROUND OF THE INVENTION 



Field of the Invention 

This invention relates to polyamides which bind to pre-determined sites of the 
20 minor groove of double-stranded DNA and have an a-amino acid domain ("positive 
patch") capable of inhibiting the activity of major groove DNA-binding proteins. 

Background of the Invention 

Polyamides containing N-methylpyrrole (Py) and N-methylimidazole (Im) amino 
25 acids bind to predetermined sequences in the minor groove of DNA with affinities and 

specificities comparable to naturally occurring DNA binding proteins (Trauger, et al. 

(1996) Nature 382, 559-561; Swalley, et al. (1997) J. Am. Chem. Soc. 119, 6953-6961; 

Turner, et al. (1997) J. Am. Chem. Soc. 119, 7636-7644). Sequence specificity is 

determined by a code of oriented side-by-side pairings of the Py and Im amino (Wade, et 
30 al. (1992) J. Am. Chem. Soc. 114, 8783-8794; Mrksich, et al. (1992) Proc. Natl Acad. 

Set U.S.A. 89, 7586-7590; Wade, et al. (1993) Biochemistry 32, 11385-11389; Mrksich, 
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et al. (1993) J. Am. Chem. Soc. 115, 2572-2576; White, et al. (1997) Chem. Biol. 4, 569- 
578; White, et al. (1997) J. Am. Chem. Soc. 1 19, 8756-8765). An Im/Py pairing targets a 
G«C base pair, while Py/Im pair recognizes CG. The Py/Py pair is degenerate and 
targets both A«T and T»A base pairs (Pelton, et al. (1989) Proc. Natl. Acad. Sci. USA 86, 
5723-5727; Chen, et al. (1994) Nature Struct. Biol. 1, 169-175; White, et al. (1996) 
Biochemistry 35, 12532-12537). The validity of the pairing rules for ligand design is 
supported by a variety of polyamide structural motifs which have been characterized by 
footprinting, affinity cleaving, 2-D NMR, and x-ray methods. The Py/Py pair is 
degenerate and targets both A»T and T*A base pairs. Polyamides have been found to be 
cell permeable and to inhibit transcription factor binding and expression of a designated 
gene (Gottesfeld, et al. (1997) Nature 387, 202-205; Nealy, et al. (1997) J. Mol. Biol, in 
press). Py/Im polyamides offer a potentially general approach for gene regulation, 
provided that efficient inhibition of DNA-binding can be achieved for a variety of 
transcription factors. 

Several approaches for the development of synthetic ligands which interfere with 
protein-DNA recognition have been reported. Oligodeoxyribonucleotides which 
recognize the major groove of double-helical DNA via triple-helix formation bind to a 
broad range of sequences with high affinity and specificity (Moser, et al. (1987) Science 
238, 645-650; Thuong, et al. (1993) Angew. Chem. Int. Ed. Engl. 32, 666-690). Although 
oligonucleotides and their analogs have been shown to disrupt protein-DNA binding 
(Maher, et al. (1992) Biochemistry 31, 70-81; Duval-Valentin, et al. (1992) Proc. Natl. 
Acad. Sci. U.S.A. 89, 504-508; Nielsen, P. E. (1997) Chem. Eur. J. 3, 505-508), the triple- 
helix approach is limited to purine tracts and suffers from poor cellular uptake. There are 
a few examples of carbohydrate-based ligands which interfere with protein-DNA 
recognition, but oligosaccharides cannot currently recognize a broad range of DNA 
sequences (Ho, et al. (1994) Proc. Natl. Acad. Sci. USA 91, 9203-9207; Liu, et al. (1996) 
Proc. Natl. Acad. Sci. USA 93, 940-944). Analogs of distamycin (PyPyPy) appended 
with multiple cationic substituents have been found to inhibit protein binding. Rational 
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design of tripyrrole peptides that complex with DNA by both selective minor-groove 
binding and electrostatic interaction with the phosphate backbone. (Bruice (1992) Proc. 
Natl Acad. Sci. USA 89, 1700-1704; Chiang, et al. (1997) Proc. Natl. Acad. ScL USA 94, 
2811-2816; Bruice, et al. (1997) Bioorg. Med. Chem. 5, 685-692). Based on these 
5 encouraging results, we wished to identify similar charged residues which could be 
appended to a Py/Im polyamide via linear solid phase synthesis and would not 
compromise polyamide binding specificity. 

Proteins use a diverse structural library to recognize their target sequences (Steitz, 
T. A. (1990) Quart. Rev. Biophys. 23, 205-280). Proteins such as TBP bind exclusively 

10 in the minor groove (Kim, et al. (1993) Nature 365, 512-520), others, such as GCN4 
Oakley, M. G. & Dervan, P. B. (1990) Structural motif of the GCN4 DNA binding 
domain characterized by affinity cleaving (Oakley, et al. (1990) Science 248, 847-850; 
Ellenberger, et al. (1992) Cell 71, 1223-1237; Konig, et al. (1993) J. Mol Biol. 233, 139- 
154), bind exclusively in the major groove, and certain proteins such as Hin recombinase 

15 recognize both grooves (Sluka, et al. (1990) Biochemistry 29, 6551-6561; Feng, et al. 
(1994) Science 263, 348-355). Polyamides have been found to interfere with protein- 
DNA recognition in cases where contacts in the minor groove are important for protein- 
DNA binding affinity. For example, within the nine zinc-finger protein TFIIIA, fingers 4 
and 6 bind in or across the minor groove and are required for high affinity binding (AT a = 

20 5 x 10 9 M" 1 ). An eight-ring hairpin polyamide (AT a — 3 x 10 10 M" 1 ) targeted to the 
" minor groove contact region of finger 4 has been recently found to efficiently inhibit 
protein binding. 

X-Ray crystallography studies reveal that DNA bound by a 4-ring homodimeric 
polyamide is unaltered from its natural B-form structure, with all polyamide/DNA 
25 contacts confined to the minor groove (Kielkopf, et al. Nature Struct. Biol, in press). 
Polyamides have been shown to bind simultaneously with ligands that exclusively 
occupy the major groove (Oakley, et al. (1992). Biochemistry 31 , 10969-10975; Park, et 
al. (1992) Proc. Natl Acad. Sci. USA 89, 6653-6657). For example, an 8-ring hairpin 
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polyamide and a recombinant protein containing only the three amino-terminal zinc 
fingers of TFIIIA which are in the major groove were found to co-occupy the TFIIIA 
binding site. Similarly, the three-ring homodimer ImPyPy bound simultaneously with the 
bZIP protein GCN4 (226-281). 

Intrinsic DNA curvature and protein induced DNA bending are also involved in 
the regulation of gene transcription, replication initiation, and other processes (Perez- 
Martin, J., et al. (1994) Microbiological Reviews 58, 268-290; Polaczek, et al. (1997), 
submitted). DNA is an inherently flexible polymer and neutral backbone analogs of 
DNA curve, where rigidity is maintained in natural DNA by coulombic repulsion 
between phosphates on the same strand Strauss, et al. (1994) Science 266, 1829-1834; 
Manning, G. S. (1983) Biopolymers 22, 689-729). Sequence-dependent curvature of 
DNA is caused both by differential solvation in the minor groove and differential base 
stacking leading to alteration of roll and tilt values (Dlakic, et a. (1996) J. Biological 
Chemistry 271, 17911-17919; Bolshoy, et al. (1991) Proc. Natl. Acad. Sci., USA 88, 
2312-2316). 

Proteins and other ligands that bend DNA alter the stacking of the bases by 
intercalation of hydrophobic groups, alter the effective Debye length of the surface 
through charge screening, or bend through energetic compensation for tight binding 
events. An example of a protein that seems to work through all three mechanisms in 
bending DNA >160 degrees is integration host factor (IHF) (Rice, et al. (1996) Cell 
87,1295-1306). Previously, it has been shown that artificial sequence specific DNA 
bending ligands can be designed that utilize bidentate tight binding third strand 
oligonucleotides to constrict the intervening duplex and bend DNA (Liberies, et al. 
(1996) Proc. Natl. Acad. Sci., USA 93, 9510-4; Akiyama, et al. (1996) Proc. Natl. 
Acad. Sci., USA 93, 1212212127; Akiyama, et al. (1996) J. Biological Chemistry 271, 
29126-29135; Akiyama, et al. (1997) Biochemistry 36, 2307-2315). 
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Compounds that bind in the minor groove such as distamycin and DAPI have 
been shown to alter DNA rigidity (Larsson, et al. (1996) 1 Physical Chemistry 100, 
3252-3263; McCarthy, et al. (1991) Nucleic Acids Research 19, 3421-9; Barcelo, et al. 
(1991) Biochemistry 30, 4863-73.). While such compounds form few specific contacts 

5 and binding is dominated by the positive charge, polyamide analogs of distamycin have 
been designed that form specific high affinity structures with DNA in the minor groove. 
In such compounds, sequence specificity is determined by the sequence of side-by-side 
amino acid pairings, where imidazole (Im) opposite pyrrole (Py) recognizes a GC base 
pair, Py-Im recognizes CG, Py-Py is degenerate for AT or TA, while Im-Im pairing is 

10 disfavored (Wade, et al. (1992) J. Am. Chem. Soc. 114, 87838794; Mrksich, et al (1992) 
Proc. Natl. Acad ScL, USA 89,7586-7590; Wade, et al. (1993) Biochemistry 32, 
1 13851 1389; Pelton, et al. (1989) Proc. Natl. Acad. ScL, USA 86, 57235727; Pelton, et 
al. (1990) 7. Am. Chem. Soc. 112,1393-1399). This recognition motif generality has 
been demonstrated for a large number of sequences and is directly supported by NMR 

15 data (Mrksich, et al. (1993) J. Am. Chem. Soc. 115, 2572-2576; Geierstanger, et al. 
(1994) Biochemistry 33, 3055-3062; Geierstanger, et al. (1993) J. Am. Chem. Soc. 
115,4474-4482; Geierstanger, et al. (1994) Science 266, 646-650; Mrksich, et al. (1995) 
J.Am. Chem. Soc. 1 17,3325-3332; Mrksich, et al. (1993) J. Am. Chem. Soc. 115, 9892- 
9899; Dwyer, et al. (1993) J. Am. Chem. Soc. 1 15, 9900-9906; Mrksich, et al. (1994) J. 

20 Am. Chem. Soc. 116, 3663-3664; Mrksich, et at. (1994) J. Am. Chem. Soc. 116, 
79837988; Chen, et al. (1994) J. Am. Chem. Soc. 116, 6995-7005; Cho, et al. (1995) 
Proc. Natl. Acad. Sci., USA 92, 10389-10392). 

BRIEF DESCRIPTION OF THE FIGURES 

25 FIGURE 1 . Schematic models of polyamides targeted to GCN4 binding site. 
FIGURE 2. Schematic models of Arg-Pro-Arg polyamides. 
FIGURE 3. Structure of DNA binding domain of Hin recombinase. 
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FIGURE 4. Eight-ring hairpin polyamides. 
FIGURE 5. Synthesis of Arg-Pro-Arg polyamides. 
FIGURE 6. GCN4 gel mobility shift experiments. 
FIGURE 7. Arg-Pro-Arg- Arg-Arg-Arg polyamides. 
FIGURE 8. GCN4 gel mobility shift experiments. 
FIGURE 9. Aliphatic amino acid substitutions in positive patch domain. 
FIGURE 10. Structure of ImPyPyPy-y-PyPyPyP y -C7-RPR. 
FIGURE 1 1 . Quantitative Dnase footprint experiments. 
FIGURE 12. Structural requirements for GCN4 binding. 
FIGURE 13. Binding domains and chemical structures of PA1 and PA2. 
FIGURE 1 4. Gel shift analysis of oligonucleotides 2-6, 9 on a 3' 32 P end-labeled 
restriction fragment. 

FIGURE 15. Gel shift analysis of oligonucleotides 2 (A) and 9 (B) on a 3' 32 P end- 
labeled restriction fragment. 

FIGURE 16. Gel shift analysis of oligonucleotide 2 on a 3' 32 P end-labeled restriction 
fragment generated with EcoRl and Hindlll. 

FIGURE 1 7. Gel shift analysis of oligonucleotide 2 on a 3' 32 P end-labeled restriction 
fragment generated with EcoRl and HindHL. 

FIGURE 1 8. MPE and DNase I footprinting analysis of oligonucleotide2 on a 3' 32 P 
end-labeled restriction fragment generated with EcoRl and PvwII. 

SUMMARY OF THE INVENTION 

This invention provides improved polyamides comprising a positive patch that, 
upon binding of the polyamide to the minor groove of a DNA molecule, is able to contact 
nucleotides in the major groove of a DNA molecule. The positive patch may comprise 
any chemical moiety that delivers a charge to the DNA molecule. The invention further 
comprises polyamides comprising having the ability to alter the conformation of a DNA 
molecule such that the function of a conformation-dependent DNA binding protein is 
inhibited. As such, the polyamides inhibit gene expression by binding the minor groove 
DNA sequence and displacing or preventing the binding or function of DNA-binding 
proteins such as transcription factors. 
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DETAILED DESCRIPTION 

Within this application, unless otherwise stated, definitions of the terms and 
illustration of the techniques of this application may be found in any of several well- 

5 known references such as: Sambrook, J., et al, Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory Press (1989); Goeddel, D., ed., Gene Expression 
Technology, Methods in Enzymology, Vol. 185, Academic Press, San Diego, CA (1991); 
"Guide to Protein Purification" in Deutshcer, M.P., ed., Methods in Enzymology, 
Academic Press, San Diego, CA (1989); Innis, et al., PCR Protocols: A Guide to Methods 

10 and Applications, Academic Press, San Diego, CA (1990); Freshney, R.I., Culture of 
Animal Cells: A Manual of Basic Technique, 2 nd Ed., Alan Liss, Inc. New York, NY 
(1987); Murray, E.J., ed. t Gene Transfer and Expression Protocols, pp. 109-128, The 
Humana Press Inc., Clifton, NJ and Lewin, B., Genes VI Oxford University Press, New 
York (1997). 

15 For the purposes of this application, a promoter is a regulatory sequence of DNA 

that is involved in the binding of RNA polymerase to initiate transcription of a gene. A 
gene is a segment of DNA involved in producing a peptide, polypeptide or protein, 
. including the coding region, non-coding regions preceding ("leader") and following 
("trailer") the coding region, as well as intervening non-coding sequences ("introns") 

20 between individual coding segments ("exons"). Coding refers to the representation of 
amino acids, start and stop signals in a three base "triplet" code. Promoters are often 
upstream ("5' to") the transcription initiation site of the corresponding gene. Other 
regulatory sequences of DNA in addition to promoters are known, including sequences 
involved with the binding of transcription factors, including response elements that are 

25 the DNA sequences bound by inducible factors. Enhancers comprise yet another group of 
regulatory sequences of DNA that can increase the utilization of promoters, and can 
function in either orientation (5 '-3' or 3'-5') and in any location (upstream or 
downstream) relative to the promoter. Preferably, the regulatory sequence has a positive 
activity, i.e., binding of an endogeneous ligand (e.g. a transcription factor) to the 

30 regulatory sequence increases transcription, thereby resulting in increased expression of 
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10 



20 



25 



30 



the corresponding target gene. In such a case, interference with transcription by binding a 
polyamide to a regulatory sequence would reduce or abolish expression of a gene 

The promoter may also inc.ude or be adjacent to a regulatory sequence known in 
the art as a silence, A silencer sequence generally has a negative regulatory effect on 
expression of the gene. In such a case, expression of a gene may be increased directly by 
usmg a polyamide to prevent binding of . factor to a silencer regulatory sequence or 
mchrectly, by using a polyamide to block transcription of a factor to a silencer regulatory 
sequence. J 

It is to be understood that the polyamides of this invention bind to double stranded 
m 3 SeqU6nCe SpCCiflC « ^e function of a segment of DNA of a given 
sequence, such as 5'-TATAAA-3', depends on its position relative to other functional 
re gl ons in the DNA sequence. In this case, if the sequence 5'-TATAAA-3' on the sense 
strand of DNA is positioned about 30 base pairs upstream of the transcription start site 
the sequence forms part of the promoter region (Lewin, Genes VI, pp. 831-835) On the 
other hand, if the sequence 5'-TATAAA-3' is downstream of the transcription start site in 
a codmg region and in proper register with the reading frame, the sequence encodes the 
tyrosyl and lysyl amino acid residues (Lewin, Genes VI, pp. 213-215). 

While not being held to one hypothesis, it is believed that the binding of the 
polyamides of this invention modulate gene expression by altering the binding of DNA 
b.ndmg proteins, such as RNA polymerase, transcription factors, TBF, TFIIIB and other 
protems. The effect on gene expression of polyamide binding to a segment of double 
stranded DNA is believed to be related to the function, e.g., promoter, of that segment of 

It is to be understood by one skilled in the art that the improved polyamides of 
the present invention may bind to any of the above-described DNA sequences or any 
other sequence having a desired effect upon expression of a gene. In addition, U.S Patent 
No. 5,578,444 describes numerous promoter targeting sequences from which base pair 
sequences for targeting an improved polyamide of the present invention may be 
identified. 

K is generally understood by those skilled in the an tha, ,he basic structure of 
DNA ,n a hving ce U taolndes both najor and a min or ^e. For the purposes of 
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describing the present invention, the minor groove is the narrow groove of DNA and the 
major groove is the deep groove of DNA as illustrated in common molecular biology 
references such as Lewin, B., Genes VI, Oxford University Press, New York (1997). 

It is further understood by those skilled in the art that a DNA binding protein is a 
5 protein capable of making contact with a DNA molecule, generally in the major groove, 
through hydrogen bonds, ionic bonds and/or hydrophobic interactions as illlustrated in 
common molecular biology references such as Lewin, supra, or Alberts, et al., Eds., 
Molecular Biology of the Cell, 3 rd , Ed., Garland Publishing, Inc., New York, 1994. 
Preferably, a DNA-binding protein is one that affects gene expression following binding 

10 to a DNA molecule. 

Several basic motifs of DNA-binding proteins are known to those skilled in the 
art. One such conformation is the helix-turn-helix motif, which includes a specific 
subclass known as homeodomain proteins. Other common motifs include the zinc finger 
motif, the leucine zipper motif, and the helix-loop-helix motif. A DNA-binding protein 

15 of this application may maintain any of the above-described motifs or any other motif 
that provides a protein with the ability to bind DNA and affect gene expression. 

To affect gene expression in a cell, which may include causing an increase or a 
decrease in gene expression, a effective quantity of one or more polyamide is contacted 
with the cell and internalized by the cell. The cell may be contacted in vivo or in vitro. 

20 Effective extracellular concentrations of polyamides that can modulate gene expression 
range from about 10 nanomolar to about 1 micromolar. Gottesfeld, J.M., et al, Nature 
387 202-205 (1997). To determine effective amounts and concentrations of polyamides in 
vitro, a suitable number of cells is plated on tissue culture plates and various quantities of 
one or more polyamide are added to separate wells. Gene expression following exposure 

25 to a polyamide can be monitored in the cells or medium by detecting the amount of the 
protein gene product present as determined by various techniques utilizing specific 
antibodies, including ELISA and western blot. Alternatively, gene expression following 
exposure to a polyamide can be monitored by detecting the amount of messenger RNA 
present as determined by various techniques, including northern blot and RT-PCR. 

30 Similarly, to determine effective amounts and concentrations of polyamides for 

in vivo administration, a sample of body tissue or fluid, such as plasma, blood, urine, 
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cerebrospinal fluid, saliva, or biopsy of skin, muscle, liver, brain or other appropriate 
tissue source is analyzed. Gene expression following exposure to a polyamide can be 
monitored by detecting the amount of the protein gene product present as determined by 
various techniques utilizing specific antibodies, including ELISA and western blot. 
5 Alternatively, gene expression following exposure to a polyamide can be monitored by 
detecting the amount of messenger RNA present as determined by various techniques, 
including northern blot and RT-PCR. 

The polyamides of this invention may be formulated into diagnostic and 
therapeutic compositions for in vivo or in vitro use. Representative methods of 

10 formulation may be found in Remington: The Science and Practice of Pharmacy y 19th 
ed., Mack Publishing Co., Easton, PA (1995). 

For in vivo use, the polyamides may be incorporated into a physiologically 
acceptable pharmaceutical composition that is administered to a patient in need of 
treatment or an animal for medical or research purposes. The polyamide composition 

15 comprises pharmaceutically acceptable carriers, excipients, adjuvants, stabilizers, and 
vehicles. The composition may be in solid, liquid, gel, or aerosol form. The polyamide 
composition of the present invention may be administered in various dosage forms orally, 
parentally, by inhalation spray, rectally, or topically. The term parenteral as used herein 
includes, subcutaneous, intravenous, intramuscular, intrasternal, infusion techniques or 

20 intraperitoneally. 

The selection of the precise concentration, composition, and delivery regimen is 
influenced by, inter alia, the specific pharmacological properties of the particular selected 
compound, the intended use, the nature and severity of the condition being treated or 
diagnosed, the age, weight, gender, physical condition and mental acuity of the intended 

25 recipient as well as the route of administration. Such considerations are within the 
purview of the skilled artisan. Thus, the dosage regimen may vary widely, but can be 
determined routinely using standard methods. 

Polyamides of the present invention are also useful for detecting the presence of 
double stranded DNA of a specific sequence for diagnostic or preparative purposes. The 

30 sample containing the double stranded DNA can be contacted by polyamide linked to a 
solid substrate, thereby isolating DNA comprising a desired sequence. Alternatively, 



WO 98/37087 



PCT/US98/02684 



11 

polyamides linked to a suitable detectable marker, such as biotin, a hapten, a radioisotope 
or a dye molecule, can be contacted by a sample containing double stranded DNA. 

The design of Afunctional sequence specific DNA binding molecules requires the 
integration of two separate entities: recognition and functional activity. Polyamides that 

5 specifically bind with subnanomolar affinity to the minor groove of a predetermined 
sequence of double stranded DNA are linked to a functional molecule, providing the 
corresponding Afunctional conjugates useful in molecular biology, genomic sequencing, 
and human medicine. Polyamides of this invention can be conjugated to a variety of 
functional molecules, which can be independently chosen from but is not limited to 

10 arylboronic acids, biotins, polyhistidines comprised from about 2 to 8 amino acids, 
haptens to which an antibody binds, solid phase supports, oligodeoxynucleotides, N- 
ethylnitrosourea, fluorescein, bromoacetamide, iodoacetamide, DL-a-lipoic acid, 
acridine, captothesin, pyrene, mitomycin, texas red, anthracene, anthrinilic acid, avidin, 
DAPI, isosulfan blue, malachite green, psoralen, ethyl red, 4-(psoraen-8-yloxy)-butyrate, 

15 tartaric acid, (+)-a-tocopheral, EDTA, methidium, acridine, Ni(II)-Gly-Gly-His, thiazole 
orange (TO), Dansyl, pyrene, N-bromoacetamide, and gold particles. Such bifunctional 
polyamides are useful for DNA affinity capture, covalent DNA modification, oxidative 
DNA cleavage, DNA photocleavage. Such bifunctional polyamides are useful for DNA 
detection by providing a polyamide linked to a detectable label. Detailed instructions for 

20 synthesis of such bifunctional polyamides can be found in copending U.S. provisional 
application 60/043,444, the teachings of which are incorporated by reference. 

DNA complexed to a labeled polyamide can then be determined using the 
appropriate detection system as is well known to one skilled in the art. For example, 
DNA associated with a polyamide linked to biotin can be detected by a streptavidin / 

25 alkaline phosphatase system. 

The present invention also describes a diagnostic system, preferably in kit form, 
for assaying for the presence of the double stranded DNA sequence bound by the 
polyamide of this invention in a body sample, such brain tissue, cell suspensions or tissue 
sections, or body fluid samples such as CSF, blood, plasma or serum, where it is desirable 

30 to detect the presence, and preferably the amount, of the double stranded DNA sequence 
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bound by the polyamide in the sample according to the diagnostic methods described 
herein. 

The diagnostic system includes, in an amount sufficient to perform at least one 
assay, a specific polyamide as a separately packaged reagent. Instructions for use of the 
packaged reagent(s) are also typically included. As used herein, the term "package" 
refers to a solid matrix or material such as glass, plastic (e.g., polyethylene, 
polypropylene or polycarbonate), paper, foil and the like capable of holding within fixed 
limits a polyamide of the present invention. Thus, for example, a package can be a glass 
vial used to contain milligram quantities of a contemplated polyamide or it can be a 
microtiter plate well to which microgram quantities of a contemplated polyamide have 
been operatively affixed, i.e., linked so as to be capable of being bound by the target 
DNA sequence. "Instructions for use" typically include a tangible expression describing 
the reagent concentration or at least one assay method parameter such as the relative 
amounts of reagent and sample to be admixed, maintenance time periods for reagent or 
sample admixtures, temperature, buffer conditions and the like. A diagnostic system of 
the present invention preferably also includes a detectable label and a detecting or 
indicating means capable of signaling the binding of the contemplated polyamide of the 
present invention to the target DNA sequence. As noted above, numerous detectable 
labels, such as biotin, and detecting or indicating means, such as enzyme-linked (direct or 
indirect) streptavidin, are well known in the art. 

Trauger,et al. (Nature, 382: 559-561) and Swalley, et al. (J. Am. Chem. Soc. 119: 
6953-6961) have described recognition of DNA by certain polyamides at subnanomolar 
concentrations. Pairing specific carboxyamide groups allows for recognition of specific 
DNA sequences (Swalley, et al. supra). Polyamides comprising Hp, Im, and Py provide 
for coded targeting of pre-determined DNA sequences with high affinity and specificity. 
Hp, Im, and Py polyamides may be combined to form Im/Py, Py/Im, Hp/Py, and Py/Hp 
binding pairs which complement the four Watson-Crick base pairs A, C, G, and T. Table 
1 illustrates such pairings. 
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TABLE 1 

Pairing Codes for Base Pair Recognition * 

Pair G*C ~~ OG ~~~ T*A A^T 

Im/Py 7 ~ + - 

Py/Im - + - - 

Hp/Py + 

Py/Hp + 



* favored (+), disfavored (-) 
5 Three-, four-, five- or six-ring improved polyamides of the present invention are 

covalently coupled to form six-, eight-, ten- or twelve-ring strucures, respectively, that 
bind specifically to four or six base pair targets, respectively, at subnanomolar 
concentrations. As such, the improved polyamides of the present invention may be 
directed to any DNA sequence comprised of A, C, G, or T. 

10 The instant invention provides polyamides having the ability to interfere with 

gene expression by altering the topology of a DNA molecule physically or by altering the 
chemical environment of the DNA molecule. By altering the topology of a DNA 
molecule, it is possible to inhibit the function of DNA-binding proteins that are 
dependent on DNA conformation for binding. The inclusion of a positive patch in a 

15 polyamide allows for alteration of the chemical environment surrounding the DNA 
molecule and serves to inhibit binding or function of DNA-binding proteins that bind the 
major groove of a DNA molecule. 

Transcription and replication of DNA is dependent upon intrinsic DNA curvature 
and protein-induced DNA bending. Previously, sequence-specific DNA bending ligands 

20 have been designed to bind two noncontiguous target sites in the major groove and 
induce a bend in the DNA (Liberies, D. A. & Dervan, P. B. (1996) Proc. Natl Acad, 
ScL, USA 93, 9510-4). This bend was shown to be dependent upon the linker length 
connecting the two sites. The present invention comprises sequence-specific polyamides 
targeted to the minor groove of the double helical linker region not overlapping either 

25 triple helical region, are capable of inhibiting bidentate third strand oligonucleotide 
binding. This inhibition through rigidification of the duplex is dependent upon the bend 
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angle of the DNA, but is independent of the order of addition of bending and 
straightening ligands. As such, polyamides may be useful for displacing DNA bending 
transcription factors in cells. 

It has been demonstrated that artificial sequence-specific DNA bending ligands 
5 can be designed that utilize bidentate tight binding third strand oligonucleotides to 
constrict the intervening duplex and bend DNA (Liberies, et al. (1996) Proc. Natl. Acad. 
ScL, USA 93, 9510-4; Akiyama, et al. (1996) Proc. Natl Acad. ScL, USA 93, 12122- 
12127; Akiyama, et al. (1996) J. Biological Chemistry 27l y 29126-29135; Akiyama, et al. 
(1997) Biochemistry 36, 2307-2315). Presented herein is a mechanism of action by which 

10 certain improved polyamides affect DNA conformation, thus inhibiting the function of 
DNA-binding proteins that are dependent upon the conformation of DNA for binding. 
While compounds such as DAPI form a few specific contacts and binding is dominated 
by the positive charge of the compound, polyamide analogs of distamycin have been 
designed that form specific high affinity structures with DNA in the minor groove. 

15 Provided herein are polyamides having the ability to displace a DNA bending by 
rigidification of a bent region not contacted by the ligand. A polyamide of the present 
invention may be utilized to "straighten", defined herein as altering the conformation of a 
DNA molecule such that a conformationally-dependent DNA-binding protein is unable to 
bind to the DNA or function properly, DNA molecules of widely varying sequence. 

20 The negatively charged DNA phosphate backbone provides a target for ligands 

designed to disrupt the unique microenvironment representing a protein binding site on 
the DNA double helix. The present invention comprises improved polyamides having a 
positive patch. Preferably, the positive patch contacts the phosphate backbone and 
disrupts the microenvironment, thereby preventing the binding and/or function of proteins 

25 that bind the DNA major groove. Polyamides that deliver a positive patch to the DNA 
backbone most likely destabilize the contacts between the protein side chains and the 
phosphate residues, and thereby inhibit protein binding (Bruice, et al. (1992). Proc. Natl. 
Acad. ScL USA 89, 1700-1704; Chiang, et al. (1997). Proc. Natl. Acad. ScL USA 94, 
281 1-2816; Bruice, et al. (1 997). Bioorg. Med. Chem. 5, 685-692) (Figure 1). 

30 The positive patch comprises any chemical moiety that is capable of delivering a 

charge to the chemical environment of the DNA molecule. The positive patch comprises 
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a charged group placed on the C-terminus, N-terminus, N-methyl group or other 
modifiable position of the polyamide. Preferably, the charged group comprises any 
amino acid having a net charge of at least +1. More preferably, the charged group is a 
primary, secondary, tertiary, quarternary amino group or a guanidinium or amidinium 
5 group. In one embodiment, the positive patch comprises an amino acid residue having a 
net positive charge joined to the C-terminus of a polyamide such that the polyamide has 
the ability to displace or prevent the binding of a DNA-binding protein to the major 
groove of a DNA molecule (Figure 12). 

It has been demonstrated that a variety of polyamide motifs co-occupy the DNA 

10 helix at sites both overlapping and adjacent to certain DNA-binding proteins, such as 
GCN4 (Figure 1). Table 2 shows schematic models of polyamides targeted to the binding 
site of the bZIP transcriptional activator, GCN4: ImPyPy-Dp, ImPyPy-y-ImPyPy-P-Dp, 
ImPyPyPy-y-PyPyPyPy-p-Dp, and ImPyPy-y-ImPyPy-p-PyPyPy-G-Dp. The nine-base 
pair (5'-CTGACTAAT-3') GCN4 binding site is indicated by brackets above and below 

15 the base pairs. Filled and unfilled circles represent imidazole (Im) and pyrrole (Py) 
polyamide rings, respectively. Diamonds and triangles represent p-alanine (P) and 
glycine (G), respectively. y-Aminobutyric acid (y) and dimethylaminopropylamide (Dp) 
are depicted as a curved line and a plus sign, respectively. P'olyamide binding sites are 
shown in bold. Equilibrium association constants (^ a ) for e ach polyamide binding to the 

20 indicated match site are shown at the right. Association constants were determined by 
DNase I footprinting; simultaneous binding was determined by gel mobility shift assay. 
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Table 2 

Polyamide Motifs That Bind Simultaneously with Major Groove DNA-Binding 

Proteins 



Polyamide Complex K (NT 1 ) 

— — _ — a 

5'-T G CIC T A C * A A TlA G T -3' 

imPyPy-Dp +> #OOC+ , x ^ 

3'~A C GiGACI OATTA. T C A -5* 



5'-T G C ICICAC!A A ' T lA G T -3' 
lmPyPy- r lmPyPy-fJ-Dp 1 X I0 7 

3'-A C G iG A C T G A T T A t T c A -5* 



5 ( -Gli! H A C U X Til G T T A T C-3 ' 
ImPyPyPy-y-PyPyPyPy-fJ-Dp +>00OOO ) 1 X 

3'-CiG ACTGATT&jA C A A T A G-5 * 



5'-T G CIC T G A C T A A TIA G T -3' 
ImPyPy-Y-lmPyPy-p-PyPyPy-G-Dp (^00<><>OCH><+ 1 x 10 10 

3 -A C G t G A C T O A T T A. T C A -5' 



The improved polyamides of the present invention have at least three consecutive 
carboxamide pairings for binding DNA in the minor groove of a regulatory sequence of a 
duplex gene sequence, and a positive patch sequence for interference with DNA-binding 
protein function. The improved polyamides may further comprise a chiral hairpin turn 
having a stereochemical center substituted at the y-position of the chiral hairpin turn of 
the molecule with the R-enantiomer of 2,4-diaminobutyric acid (H 2 NHCHCH 2 CHNH 2 - 
COOH; "(#) H2N y "). 

The present invention comprises improved polyamides having three or four-ring 
polyamide structures covalently coupled to form six-, eight-, ten- or twelve-ring hairpin 
structures, respectively, of the general structures I- VIII: 
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X 1X2X37X4X5X5 A X,X 2 X3X 4 yX s X 6 X 7 X 8 A X,X2X 3 X 4 X5yX 6 X 7 X 8 X 9 X 10 A 
I II III 



X|X 2 X 3 X 4 X5X 6 YX 7 X g X 9 X| 0 X 1 |X 12 A 
5 IV 



X,X 2 X 3 ^ X',X' 2 X^ X,X 2 X 3 X 4x X\X' 2 X',X\^ 

Y Y y y 

AX 6 X 5 xf N LX' 6 X 1 5 X 1 4 / X P AX 8 X 7 X 6 xf X LX\X' 7 X< 6 X'f X p 

V VI 



X) X 2 X 3 X 4 X 5 X', X' 2 X' 3 X 4 X' 5 

/\ y\ 

AX , 0 X 9 X 8 X 7 X 6 LX 10 X 9 X 8 X 7 X 6 p 
VII 



X, X 2 X 3 X 4 X 5 X 6 X\X' 2 X 3X4X5X5 ^ 

AX 12 X n X 10 X 9 X 8 X 7 LX 12 X f ! |X' ]0 X l 9 X , g X > 7 p 
VIII 

where X M2 and X\. 12 are independently an imidazole such as N-methylimidazole (Im), a 
pyrrole such as N-methylpyrrole (Py), or a hydroxypyrrole such as 3-hydroxy-N-methyl 

10 pyrrole (Hp). In addition, an improved polyamide of the present invention may further 
include a aliphatic amino acid such as p-alanine residue (fi), an amide group such as 
dimethylaminopropylamide (Dp), an alcohol such as EtOH, an acid such as EDTA, or 
any derivative thereof may be joined to the (} residue. 

"A" represents a positive patch which comprises any chemical moiety that is 

15 capable of delivering a positive charge to the major groove of a DNA molecule. 
Preferably, the positive patch comprises a rigid group joined to a positively charged 
group. The rigid group positions the positively charged group such that contact with the 
major groove or the phosphate backbone of a DNA molecule is accomplished. 

Preferably, the rigid group comprises one to ten amino acids. More preferably, 

20 the rigid group comprises one to eight amino acids. More preferably, the rigid group 
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comprises one to six amino acids. More preferably, the rigid group comprises one to four 
amino acids. Most preferably, the rigid group comprises two amino acids. Of the most 
preferred rigid group, the first amino acid is positioned adjacent to the polyamide and 
may comprise arginine, proline, lysine, hydroxyproline, the corresponding L or D 
enantiomers thereof or a derivative thereof. Preferably, the first amino acid is arginine or 
lysine and most preferably the first amino acid is arginine. The second amino acid is 
positioned at the carboxy end of the first amino acid. Suitable second amino acids 
comprise proline, glycine, serine, threonine, leucine, isoleucine, valine, alanine, 
hydroxyproline the corresponding L or D enantiomers thereof or a derivative thereof. 
Preferably, the second amino acid is proline or glycine and most preferably the second 
amino acid is proline. 

A suitable positively charged group comprises a synthetic or naturally occurring 
amino acid. Preferably, the positively charged group is a primary amino group, 
secondary amino group, tertiary amino group, quartenary amino group, guanidinium 
group or amidinium group. It is preferred that the positively charged group is an amino 
acid bearing a net charge of at least +1 . More preferably, the positively charged group is 
arginine, lysine, histidine or a derivative thereof. Most preferably, the positively charged 
group is arginine. 

The positive patch may be joined to the polyamide using an "attachment" group. 
Preferably, the attachment group comprises an amino acid. More preferably, the 
attachement moiety is P-alanine, y-aminobutyric acid, valeric acid, or any of the 
corresponding 2-amino derivatives of p-alanine, y-aminobutyric acid, or valeric acid. 
Most preferably, the attachment group is P-alanine. 

In addition, an improved polyamide of the present invention may further include a 
aliphatic amino acid such as p-alanine residue (P), an amide group such as 
dimethylaminopropylamide (Dp), an alcohol such as EtOH, an acid such as EDTA, or 
any derivative thereof may be joined to the p residue. 

p-alanine may also be utilized in place of a pyrrole or hydroxypyrrole amino acid 
in Formulas I- VIII. The use of P-alanine in place of a pyrrole or hydroxypyrrole amino 
acid in the synthetic methods provides aromatic/aliphatic pairing (Im/p, p/Im, Py/p, and 



WO 98/37087 



PCT/US98/02684 



19 

p/Py) and aliphatic/aliphatic pairing (p/p) substitution. Such substitutions may comprise 
those described in provisional application 60/042,022, incorporated herein by reference. 
The use of y-aminobutyric acid, or a substituted y-aminobutyric acid such as (R)-2,4 
diaminobutyric acid, provides for preferred hairpin turns. Inclusion of the positive patch 

5 allows the polyamide to deliver a positive residue to the DNA backbone and interfere 
with protein-phosphate contacts. As such, the positive patch contacts the major groove or 
the phosphate backbone of a DNA molecule and inhibits the binding or function of DNA- 
binding proteins. Many other groups suitable for the purposes of practicing this invention 
are well known and widely available to one skilled in the art. 

10 The polyamide subunit structures I- VIII above may be covalently coupled through 

the y residue which represents a -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin linkage derived from 
y-aminobutyric acid or a chiral hairpin linkage derived from R-2,4-diaminobutyric acid. 
The present invention provides the reagents and methodologies for substituting the y- 
residue of certain polyamides with a moiety such as (7?)-2,4,-diaminobutyric acid 

15 ((fl) H2N y). The NMR structure of a hairpin polyamide of sequence composition 
ImPyPy-y-PyPyPy complexed with a 5'-TGTTA-3' target site indicated that it was 
possible to substitute the a-position of the y-aminobutyric acid residue within the hairpin- 
DNA complex (de Claire, et al. J. Am. Chem. Soc. 1997, 779, 7909). Modeling indicated 
that replacing the a-H of y with an amino group that may confer an 7?-configuration at the 

20 a-carbon could be accommodated within the floor and walls of the minor groove as 
demonstrated in Figure 1 and 2A. In contrast, the ($-2,4,-diaminobutyric acid ((£)* * 2N y ) 
linked hairpin is predicted to clash with the walls of the minor groove of the DNA helix 
as illustrated in Figures 1 and 2B. 

In Formulas V-VIII, L represents an amino acid linking group such as P-alanine 

25 or 5-aminovaleric acid (8) bound to the y residue of a first polyamide and to the 
carboxytail of a second polyamide. As such, two or more polyamides may be linked, 
forming a tandemly-linked polyamide. Such a polyamide is said to be tandemly-linked or 
a tandem-linked polyamide. 
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P represents from zero to ten polyamides of formulas IX-XII: 

X,X 2 X 3 yX 4 X 5 X 6 X,X 2 X3X 4 yX 5 X 6 X 7 X 8 X I X 2 X s X 4 X s 7X 6 X 7 X g X 9 X 10 
IX X XI 

XjX 2 X 3 X 4 X 5 X 6 yX 7 X 8 X 9 X 1() X n X 12 
XII 

that may be tandemly linked to another polyamide. Preferably, P represents from zero to 
eight polyamides of formulas IX-XII. More preferably, P represents from zero to six 
polyamides of formulas IX-XII. More preferably, P represents from zero to four 
polyamides of formulas IX-XII. Most preferably, P represents from zero to two 
polyamides of formulas IX-XII. In Formulas IX-XII, X,-X 12 are as defined above. 
Tandem linking of polyamides provides expanded binding site size and increased binding 
affinity without compromising selectivity. Many other groups suitable for the purposes 
of practicing this invention are well known and widely available to one skilled in the art. 

Baird, et al. (J. Am. Chem. Soc. 118: 6141-6146) and PCT/US97/003332 describe 
methods for synthesis of polyamides which are suitable for preparing polyamides of this 
invention. Polyamides of the present invention may be synthesized by solid phase 
methods using compounds such as Boc-protected 3-methoxypyrrole, imidazole, and 
pyrrole aromatic amino acids, which are cleaved from the support by aminolysis, 
deprotected with sodium thiophenoxide, and purified by reverse-phase HPLC. The 
identity and purity of the polyamides may be verified using any of a variety of analytical 
techniques available to one skilled in the art such as 1H-NMR, analytical HPLC, and/or 
matrix-assisted laser-desorption ionization time-of-flight mass spectrometry (MALDI- 
TOF MS-monoisotropic). 

Described herein is the synthesis of a new class of polyamides and their 
characterization with regard to DNA binding affinity and sequence specificity. Revealed 
herein to one skilled in the art are properties of positive patch elements that may be 
utilized as a guide in the design of more efficient polyamides. The present invention 
provides one skilled in the art with the reagents and methodologies for the design, 
synthesis and utilization of polyamides comprising a positive patch. As a representative 
example, a series of polyamides with Arg-Pro-Arg tripepetides at the C-terminus are 
provided. It is demonstrated that such polyamides selectively inhibit DNA binding by the 
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major groove-binding transcription factor GCN4, as measured by gel mobility shift 
assays. Polyamides having certain residues of the positive patch substituted are provided 
to identify the function of each amino acid in inhibition of protein binding. Also 
provided are DNase I footprint titration experiments to measure the effect of net ligand 
5 charge on both the DNA binding affinity and specificity of the hairpin polyamide. As 
demonstrated herein, addition of an Arg-Pro-Arg tripeptide does not result in significant 
alteration of polyamide-DNA binding affinity or specificity. 

The instant invention provides polyamides that act as synthetic ligands to affect 
binding of proteins with affinity for the major groove of DNA, as well as methods of 

10 making and using such polyamides. In addition, sequences adjacent to or neighboring the 
protein binding site may be targeted, as exemplified by selective inhibition of GCN4 at 
two sites using Arg-Pro-Arg-polyamides. The Arg-Pro-Arg domain appears to adopt a 
stable and defined structure which delivers a neutralizing positive charge to the DNA 
backbone where it competes with GCN4 for contact to the phosphates. The broad 

1 5 targetable sequence repertoire of polyamides, coupled with the ubiquity of backbone 
contacts in protein recognition of DNA, make phosphate neutralization by a positive 
patch a promising approach for inhibition of major groove transcription factors. 

One skilled in the art may utilize the examples provided herein to design, 
polyamides comprising a positive patch sequence. The examples listed above and those 

20 illustrated below represent only certain embodiments of the present invention and are not 
limiting of the specification and claims in any way. 



EXAMPLES 
Example 1 

25 Materials and Methods 

All buffers for gel mobility shift and footprinting experiments were prepared from 
J.T. Baker reagents and 0.2 jxM filtered. EDTA and DTT were obtained from Gibco 
BRL. Poly(dI-dC>poly(dI-dC) was from Pharmacia Biotech. Ficoll (MW 400,000) was 
purchased from Sigma. T4 polynucleotide kinase, £coRI, PvuII, and DNase I were from 
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Boehringer Mannheim. AflSL and Fsp\ were purchased from New England Biolabs. 
Sequenase (version 2.0) was obtained from United States Biochemical. [a- 32 P]- 
Thymidine-5'-triphosphate (> 3000 Ci/mmol), [a- 32 P]-deoxyadenosine-5'-triphosphate 
(>6000 Ci/mmol), and were purchased from Du Pont/NEN. [y- 32 P]-Adenosine-5'- 
5 triphosphate (> 7000 Ci/mmol) was obtained from ICN. GCN4 (222-281) was prepared 
by solid phase synthesis. MBHA resin (0.57 mmol/g) was from Calbiochem. Boc-p-Ala, 
Boc-Y-aminobutyric acid, Boc-(Tos)Arg, Boc-Ala, Boc(CBz)Lys, Boc-Gly, Boc-Pro and 
Boc-D-Pro were from Peptides International. /?-Cresol was purchased from Aldrich. All 
other chemicals, as well as the purification and characterization of polyamides were as 
10 previously described (Baird, et al. (1996) J. Am. Chem. Soc. 118, 6141-6146). 

Example 2 

Arg-Pro-Arg Polyamides 
Schematic models of certain polyamides targeted to the binding site of the bZIP 

15 transcriptional activator GCN4 are illustrated in Figure 1, parts a-d. A series of 
polyamides with Arg-Pro-Arg tripeptides at the C-terminus were synthesized by solid 
phase methods. The polyamides were evaluated as inhibitors of the major groove 
transcription factor GCN4, the prototypical member of the basic region-leucine zipper 
(bZIP) family of transcriptional regulators (Hurst, H. C. (1995) Protein Profile 2, 105- 

20 168; Struhl, K. (1992) Yeast GCN4 transcriptional activator protein. In Transcriptional 
Regulation. (McKnight, S. L. & Yamamoto, K. R., eds), pp. 833-859, Cold Spring 
Harbor Laboratory Press, New York; Cuiran, T.& Vogt, P. (1992) Dangerous liasions: 
fos and jun, oncogenic transcription factors. In Transcriptional Regulation. (McKnight, 
S. L. & Yamamoto, K. R., eds), pp. 797-832, Cold Spring Harbor Laboratory Press, New 

25 York). The C-terminal sixty amino acids (222-281) of GCN4 contain the "leucine 
zipper" dimerization domain and the "basic region" which is responsible for DNA 
binding. GCN4 (222-281) has been shown to be sufficient for sequence specific binding 
(Oakley, et al. (1990) Science 248, 847-850; Hope, et al. (1986) Cell 46, 885-894; 
Paolella et al (1994) Science 264, 1130-1133). The basic region of each a-helical 

30 monomer makes specific hydrogen bonds, van der Waals contacts, and phosphate 
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interactions with one half-site of the nine base pair pseudosymmetrical GCN4 binding 
site (Figure 2 A & B) (Oakley, et al. (1990), supra; Ellenberger, et aL (1992) Cell 71, 
1223-1237; Konig, et ah (1993) J. Mol. Biol. 233, 139-154). The protein-DNA 
electrostatic interactions which are targeted for disruption by the Arg-Pro-Arg- 
5 polyamides are highlighted in Figure 2B. 

Homeodomain proteins recognize the minor groove of DNA via a highly 
conserved Arginine (Arg) - X aa - Arg (X aa = any amino acid) (Gehring, W. J., et al, & 
Wtithrich, K. (1994) Homeodomain-DNA recognition. Cell 78, 21 1-223; Gehring, et al. 

(1994) Homeodomain proteins. Annu. Rev. Biochem. 63, 487-526). In Hin recombinase, 

140 141 142 

10 the corresponding Arg - Proline (Pro) - Arg domain serves as a bridge 

between the N-terminal arm in the minor groove and the helix-turn-helix motif which 

recognizes the major groove (Sluka, et al. (1990) Biochemistry 29, 6551-6561; Feng, et 

al. (1994) Science 263, 348-355). Minor groove contacts made by the side chain of 
140 > • 

Arg direct the peptide chain up from the floor of the minor groove, toward the 

142 

1 5 backbone, where the guanidinium of Arg makes electrostatic contact with phosphates. 

Upon interaction with DNA, the Arg-Pro-Arg domain achieves a stable, local tertiary 

structure which is potentially based solely on the primary sequence. It was postulated that 

Arg-Pro-Arg attached at the C-terminus of a polyamide would adopt a similar structure to 
140 141 142 

that of Arg -Pro -Arg in Hin recombinase. The resulting Arg-Pro-Arg- 
20 polyamide could be used to place a neutralizing positive charge at a predetermined 
phosphate on the DNA backbone (Figure 3). Arg-Pro-Arg-polyamide synthesis is 
exemplified here for polyamide 3. 

ImPyPyPy-y-PyPyPyPy-$-RPR (compound 3) 

Polyamides 1-4 are illustrated in Figure 4A-D. ImPyPyPy-y-PyPyPyPy-p-RPR- 

25 MBHA-resin was synthesized in a stepwise fashion by machine-assisted solid phase 

methods [44] from MBHA resin (600 mg, 0.57 mmol/g, calculated as L n ew(nunol/g) = 

_3 

Lold/(l + Lold(W n ew - W Q ld) x 10 ), where L is the loading (mmol of amine per gram 
of resin) and W is the weight (g mof *) of the growing peptide attached to the resin 
(Barios, et al. (1991) Int. J. Peptide Protein Res. 37, 513-520). A sample of polyamide 
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resin (300 mg, 0.30 mmol/g) was placed in a Kel-F reaction vessel, ^-cresol (1 g) added, 
and the vessel cooled to -60 °C. HF was then condensed into the vessel. The solution 
was stirred vigorously for one hour (0 °C) and the excess HF was removed in vacuo. The 
reaction mixture was then treated with cold ethyl ether (50 mL) and the resulting 
5 resin/polyamide coprecipitate was collected by vacuum filtration. The polyamide was 
then extracted with CH3CN:H20:TFA (10:9:1), 0.1% (w/v) TFA added (6 mL) and the 
resulting solution purified by reverse phase HPLC using a Waters DeltaPak 25 x 100 mm 
100 urn Ci8 column in 0.1% (w/v) TFA, gradient elution 0.25%/min. CH3CN. 
ImPyPyPy-y-PyPyPyPy-p-RPR-NH2 was recovered upon lyophilization of the 

10 appropriate fractions as a white powder (84 mg, 60% recovery); UV (H2O) X maK 244, 
312 (66,000); X H NMR (DMSO-rf 6 ): 6 10.49 (s, 1H), 9.96 (s, 1H), 9.93 (s, 1H), 9.90 (s, 
3H), 9.84 (s, 1H), 8.19 (d, 1H, J=7.6 Hz), 8.07 (m, 1H), 8.02 (m, 1H), 7.95 (m, 1H), 7.91 
(m, 1H), 7.53 (m, 3H), 7.38 (s, 1H), 7.26 (d, 1H, J=1.6 Hz), 7.25 (s, 1H), 7.21 (d, 1H, 
J-1.6 Hz), 7.20 (s, 1H), 7.19 (s, 1H), 7.14 (m, 3H), 7.03 (m, 4H), 6.87 (d, 1H, J=1.6 Hz), 

15 6.85 (m, 2H), 4.48 (t, 1H, J=4.8 Hz), 4.27 (q, 1H, J=4.4 Hz), 4.77 (m, 1H), 3.96 (s, 3H), 
3.81 (m, 12H), 3.80 (s, 3H), 3.77 (s, 3H), 3.76 (s, 3H), 3.60 (m, 2H), 3.32 (q, 2H, J=4.8 
Hz), 3.17 (q, 2H, J=6.1 Hz), 3.0 (m, 4H), 2.36 (t, 2H, J=6.9 Hz), 2.25 (t, 2H, J=6.9 Hz), 
2.0 (m, 2H), 1.77 (m, 4H), 1.64 (m, 2H), 1.46 (m, 6H). 

The polyamides ImPyPyPy-y-PyPyPyPy-p-Dp (1) and ImlmPyPy-y-ImPyPyPy-P- 

20 Dp (2) were synthesized in a stepwise manner from Boc-P-alanine-Pam resin using Boc- 
chemistry machine-assisted protocols as previously described (Baird, et al. (1996) J. Am. 
Chem. Soc. 118, 6141-6146). Polyamides with C-terminal aliphatic amino acids were 
synthesized on MBHA resin from Im and Py monomer units and commercially available 
aliphatic amino acids in 26 steps (Figures 4 and 5). Treatment with HF:p-cresol (9:1) 

25 followed by precipitation with ethyl ether and extraction with 0. 1 % TFA:CH3CN (50:50) 
afforded the deprotected polyamide which was purified by reverse phase HPLC. 
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Example 3 

Selective inhibition of GCN4 (222-281) binding by Arg-Pro-Arg poly amides 

Synthetic DNA fragments were prepared on an ABI 380B Automated DNA 
Synthesizer and purified by preparative denaturing polyacrylamide gel electrophoresis. 
5 ARE-1 (5 ' -CCGG ATCC ATGGTTGCTG ACT AATTGTTATCCTCT AG AGTCG ACC- 

3') and ARE-2 (5' -CCGG ATCC ATGGTTGCTG ACT AATTGGTCTCCTCT AG AGTCG 

32 

ACC-3') were radiolabeled at the 5'-terminus with y- P-ATP and T4 polynucleotide 
kinase, annealed to an equimolar amount of the unlabeled complement, and purified by 
nondenaturing polyacrylamide gel electrophoresis (Sambrook, et al. Molecular Cloning, 

10 (2nd ed.). Cold Spring Harbor Laboratory Press: Cold Spring Harbor, NY). 

For gel mobility shifts, polyamide was incubated with radiolabeled synthetic 
DNA duplex (10 kcpm) in 40 \xL reaction volumes of bisTris (10 mM, pH 7.0), NaCl 
(100 mM), DTT (1 mM), EDTA (1 mM), and poly(dI-dC>poly(dI-dC) (50 ^ig/mL) for 
16 hours at 22 °C (20 mM MOPS, pH 7.0, 140 mM KC1, 10 mM NaCl, ImM MgCl2, 1 

15 mM spermine was used to model ionic conditions in vivo). GCN4 (222-281) was added 
and equilibrated for 30 minutes. Loading buffer (15% Ficoll, 0.025% bromophenol blue) 
(10 \xL) was added and 10 |iL was immediately loaded onto a running 8% (29:1, 
acrylamide:bis-acrylamide) polyacrylamide gel (0.5 X TBE, 280 V, 0.8 mm, 13 cm). 
Sufficient separation of the free DNA and the DNA»GCN4 (222-281) complexes was 

20 achieved within 45 minutes. Gels were dried and exposed to a storage phosphor screen 
(Molecular Dynamics) (Johnston, et al. (1990) Electrophoresis 1 1, 355). 

Synthetic radiolabeled DNA duplexes, ARE-1 and ARE-2, containing a GCN4 
binding site (5 '-CTGACTAAT-3 ') (Oakley, et al. (1990) Science 248, 847-85029; 
Oakley, et al. (1992) Biochemistry 31, 10969-10975), were bound near saturation at 200 

25 nM GCN4 (222-281) as revealed by gel mobility shift analysis (10 mM bisTris pH 7.0, 
100 mM NaCl, 1 mM DTT, 1 mM EDTA, 50 ^g/mL poly(dI-dC)*poly(dI-dC), 22 °C). 
ImPyPyPy-y-PyPyPyPy-p-polyamides (1 and 3) target the six base pair 5'-TGTTAT-3' 
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site adjacent to the GCN4 binding site of ARE-1 (Figure 6). ImlmPyPy-y-ImPyPyPy-P- 
polyamides (2 and 4) were designed to bind 5'-TGGTCT-3' adjacent to the GCN4 site in 
ARE-2 (Figure 6). 

The ability of polyamides to inhibit GCN4 (222-281) binding was evaluated using 
5 the gel mobility shift assay. Increasing concentrations of polyamide were incubated with 
the desired radiolabeled synthetic DNA duplex followed by the addition of a constant 
concentration of 200 nM GCN4 (222-281). DNA fragments bound and unbound by 
GCN4 were separated using nondenaturing polyacrylamide gel electrophoresis. 

Polyamides 1 and 2, which lack the Arg-Pro-Arg moiety were unable to inhibit 
10 GCN4 binding (Figure 6). The upper band in Figure 6A is the ARE-1 fragment bound by 
GCN4 (222-281) (lanes 2-12). Lanes 3-1 1 show that GCN4 binding was unaffected by 
the addition of ImPyPyPy-y-PyPyPyPy-p-Dp. However, ImPyPyPy-y-PyPyPyPy.p-RPR, 
which differs from 1 by the addition of the C-terminal Arg-Pro-Arg, inhibited GCN4 
binding to ARE-1 (Figure 6C). When bound to its match site on ARE-2, ImlmPyPy-y- 
1 5 ImPyPyPy-p-RPR also successfully inhibited GCN4 binding (Figure 6F). 

Based on the pairing rules for polyamide-DNA complexes, the sites 5'-TGTTAT- 
3' (ARE-1 fragment) and 5'-TGGTCT-3' (ARE-2 fragment) are for ImPyPyPy-y- 
PyPyPyPy-RPR "match" and "double-base-pair mismatch" sites, respectively, and for 
ImlmPyPy-y-ImPyPyPy-RPR "double-base pair mismatch" and "match" sites, 
20 respectively. Incubation of GCN4 and up to 2 \xM ImPyPyPy-y-PyPyPyPy-p.RPR w ith 
the double mismatch ARE-2 fragment resulted in no inhibition of GCN4 binding (Figure 
6D). Likewise, ImlmPyPy-y-ImPyPyPy-p-RPR did not inhibit GCN4 binding to the 
mismatched ARE-1 fragment (Figure 6E). 

When bound to their respective match sites, ImPyPyPy-y-PyPyPyPy-p-RPRRRR 
25 (5) and ImlmPyPy-y-ImPyPyPy-p-RPRRRR (6), which contain an additional three C- 
terminal arginine residues relative to 3 and 4, were found to folly inhibit GCN4 (222-281) 
binding (Figures 7 and 8). The gel mobility shift experiments depicted in Figure 6 
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demonstrate that 5 and 6 selectively provided complete inhibition of GCN4 binding with 
no apparent loss in specificity for double-base-pair mismatches. 

Example 4 

5 Design of Optimum Tripeptide for GCN4 Inhibition 

Polyamides with deletions and/or substitutions in the Arg-Pro-Arg domain were 

prepared in order to determine the elements which were essential for GCN4 inhibition 

(Figure 9). Each of these polyamides was based on the ImPyPyPy-y-PyPyPyPy-p 

polyamide targeted to 5'-TGTTAT-3' of ARE-1. The ability of polyamides 7-14 

10 (Figures 9 and 10) to bind their DNA target sites and inhibit GCN4 binding to ARE-1 

was evaluated using DNase I footprinting and gel mobility shift analysis. 

DNase I footprinting of polyamides 1-14 was performed on restriction fragments 

containing the appropriate ARE-1 or ARE-2 sequences under conditions identical to 

those used for the gel mobility shift experiments. In every case, (except 11, see below) 

7 1 

15 the polyamide was found to specifically bind the target site with AT a ~ 1 x 10 M 
(Figure 11). Lower AT a values are observed for polyamides under the gel shift conditions 
due to the carrier DNA which artificially depresses polyamide binding constants. The 
polyamide concentrations required for GCN4 inhibition are within the expected range 
based on the K% under gel shift conditions. 

20 Deletion of the terminal Pro-Arg or Arg, as in ImPyPyPy-y-PyPyPyPy~p-R, (7) 

and ImPyPyPy-y-PyPyPyPy-P-RP (8), results in polyamides which are unable to inhibit 
GCN4 binding. Substituting the proline with glycine afforded ImPyPyPy-y-PyPyPyPy-p- 
RGR (9), which did not effectively inhibit GCN4 (222-281). At 1 of 9, < 50% of the 
GCN4 was inhibited. No inhibition of GCN4 binding was observed for ImPyPyPy-y- 

25 PyPyPyPy-P-R D PR (10) which contained a single inversion of stereochemistry relative 
to 3. 
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The internal arginine was replaced with an alanine residue to provide ImPyPyPy- 
y-PyPyPyPy-P-APR (11). 11 was unable to inhibit GCN4 binding under these 
conditions. This Arg to Ala substitution was the only alteration which was found to 
affect polyamide binding affinity. By DNase I footprinting, 11 binds the 5'-TGTTAT-3' 
5 target site with ten-fold lower affinity than 3 under conditions identical to those used for 
gel shift analysis. The conservative substitution of lysine for arginine in ImPyPyPy-y- 
PyPyPyPy-P-KPR (12) also compromised the polyamide's ability to inhibit GCN4. At 1 
\xM 12, < 50% of the bound GCN4 was inhibited, similar to 9. However, the identical 
substitution in the C-terminal position afforded ImPyPyPy-y-PyPyPyPy-p-RPK (13), 

10 which inhibited GCN4 binding identically to the Arg-Pro-Arg polyamide 3. The amino 
acid linkage between the final Py amino acid and the initial arginine was also crucial for 
GCN4 inhibition. A polyamide in which the p-alanine linker was replaced with a 7- 
aminoheptanoic acid linker, ImPyPyPy-y-PyPyPyPy-C7-RPR (14), was unable to inhibit 
GCN4 binding. Protein inhibition did not require prebinding of polyamide. 

15 Preincubation of ARE-1 with GCN4 followed by addition of 5 afforded inhibition 
identical to that of prebound polyamide. 

By targeting an 8-ring Arg-Pro-Arg-polyamide adjacent to a GCN4 binding site, 
selective inhibition of DNA binding by a protein which exclusively contacts the major 
groove is achieved (Figures 4 and 6). The polyamide domain binds sequence specifically 

20 in the minor groove with double base pair mismatches preventing GCN4 inhibition. 

The inability of truncated analogs 7 (R) or 8 (RP) to inhibit GCN4 binding 
indicates that the C-terminal arginine in 3 (RPR) is crucial for GCN4 inhibition. Based on 
the Hin recombinase model, this arginine is expected to make non-specific contacts to the 
DNA phosphate backbone. The ability of 13 (RPK) to inhibit GCN4 identically to 3 

25 (RPR) supports this model. The neutralization of a portion of the backbone is the most 
likely mechanism by which Arg-Pro-Arg polyamides achieve GCN4 inhibition. Other 
models, such as steric blockage of the major groove or DNA distortion, cannot be ruled 
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out (Strauss, et al. (1994) Science 266, 1829-1834). Modeling suggests that Arg-Pro-Arg 
is insufficient to cross the DNA backbone and block the major groove. Determination of 
the exact mechanism of inhibition awaits high-resolution structure studies which are in 
progress. 

5 The results of GCN4 inhibition experiments with polyamides 7-14 suggest that 

the Arg-Pro-Arg of 3 and 4 adopts a stable and well-defined structure similar to Arg* 4 ^- 
141 142 

Pro -Arg of Hin recombinase. The internal Arg-Pro of 3 and 4 (RPR) is required 
for GCN4 inhibition. Polyamide 8 does not inhibit GCN4, suggesting that these two 
residues play a structural role in the placement of the terminal arginine near the phosphate 

10 backbone. Replacing the rigid proline of 3 (RPR) with a flexible glycine (9) (RGR) 
allows significant amounts of GCN4 to remain bound in the presence of saturating 
concentrations of 9. The glycine in 9 may permit the C-terminal arginine to shift to a 
position which permits simultaneous binding with GCN4. 

ImPyPyPy-y-PyPyPyPy-p-R D PR (10) is a diastereomer of ImPyPyPy-y- 

15 PyPyPyPy-P-RPR (3), but is unable to inhibit GCN4. Modeling indicates that 
substitution of D-proline for L-proline may in fact direct the neutralizing terminal arginine 
to the backbone of the opposite DNA strand. However, confirmation of this prediction 
awaits studies with other protein systems. 

Replacement of the internal arginine with alanine, as in 11, reduces binding 

20 affinity by a factor of ten and prevents GCN4 inhibition. Furthermore, the Lys-Pro-Arg 
polyamide, 12, exhibits a binding affinity comparable to 3, yet it is a less effective 
inhibitor of GCN4. Together, these results suggest that the guanidinium of the internal 
arginine makes specific contacts with the DNA which are required for the proper 
positioning of the remaining residues. Replacement of the P-alanine linker (3) with 7- 

25 aminoheptanoic acid (14) eliminates inhibition, further implicating the placement of the 
Arg-Pro-Arg moiety as a requirement for effective inhibition. 



WO 98/37087 



30 



PCT/US98/02684 



Example S 

Salt Dependence of Positive Patch Polyamides 
In order to evaluate the sensitivity of positive patch mediated major groove 
protein inhibition to the nature of the compensating electrolyte, as well as the overall 
ionic strength, gel mobility shift analysis was performed using a buffer which models the 
environment of the cellular nucleus (20 mM MOPS, pH 7.2, 140 mM KC1, 10 mM NaCl, 
1 mM MgCl2, 1 mM spermine) (Jones, et al. (1993) J. Org. Chem. 58, 2983-2991). Arg- 
Pro-Arg-polyamide 3 was found to inhibit GCN4 (222-281) binding under the in vivo 
ionic conditions which feature KC1 as the primary compensating electrolyte and the 
conditions optimized for protein binding which feature NaCl as the predominant 
compensating electrolyte (Figure 6C). Further biophysical characterization of major 
groove protein inhibition by positive patch polyamides will be reported in due course. 

Example 6 

Arg-Pro-Arg Polyamide Binding Affinity and Specificity 

In order to evaluate the effects of the Arg-Pro-Arg moiety on the DNA binding 

properties of the polyamides, quantitative DNase I footprint titration experiments were 

performed to determine the equilibrium association constants of polyamides 1-6 for their 

respective six base pair match and single-base-pair mismatch sites (10 mM Tris pH 7.0, 

10 mM KC1, 10 mM MgCl2, 5 mM CaCl2, 22 °C) (Brenowitz, et al. (1986) Methods 

Enzymol 130, 132-181). 

The Afl WJFsp I restriction fragment of pJT8 (Trauger, et al. (1996) Nature 382, 
32 

559-561) was 3'- P-end-labeled by digesting the plasmid with AflU and Fspl and 

32 

simultaneously filling in using Sequenase, [a- P]-deoxyadenosine-5 ! -triphosphate, and 
32 

[a- P]-thymidine-5'-triphosphate, and isolating the 229 bp fragment by nondenaturing 
gel electrophoresis. The 250 bp EcoRUPvuU restriction fragment of pJK6 (Struhl, K. 
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(1992) Yeast GCN4 transcriptional activator protein. In Transcriptional Regulation, 
(McKnight, S. L. & Yamamoto, K. R., eds), pp. 833-859, Cold Spring Harbor Laboratory 
Press, New York) was prepared in an analogous manner. A and G sequencing were 
carried out as described (Maxam, et al. (1980) Methods Enzymol. 65, 499-560; Iverson, et 

5 al. (1987). Methods Enzymol 15, 7823-7830). 

All reactions were executed in a total volume of 400 |liL. A polyamide stock 
solution or H2O (for reference lanes) was added to an assay buffer containing 
radiolabeled restriction fragment (20 kcpm), affording final solution conditions of 10 mM 
Tris*HCl (pH 7.0), 10 mM KC1, 10 mM MgCl2, 5 mM CaCl2, and either (i) 0.001 nM- 

10 100 nM polyamide or (ii) no polyamide (for reference lanes). The solutions were allowed 
to equilibrate at 22 °C for 1 8h. Footprinting reactions were initiated by the addition of 4 
\\L of a DNase I stock solution (at the appropriate concentration to give ~ 55% intact 
DNA) containing 1 mM dithiothreitol and allowed to proceed for seven min at 22 °C. 
The reactions were stopped by the addition of 50 \xL of a solution containing 1.25 M 

15 NaCl, 100 mM EDTA, 0.2 mg/mL glycogen, and 28 ^iM base-pair calf thymus DNA, and 
ethanol precipitated. Reactions were resuspended in IX TBE/80% formamide loading 
buffer, denatured by heating at 85 °C for 10 min, and placed on ice. The reaction 
products were separated by electrophoresis on an 8% polyacrylamide gel (5% cross-link, 
7 M urea) in IX TBE at 2000 V for 1.5h. Gels were dried and exposed to a storage 

20 phosphor screen (Molecular Dynamics) (Johnston, et al. (1990) Electrophoresis 1 1 , 355). 

Data from the footprint titration gels were obtained using a Molecular Dynamics 
400S Phosphorlmager followed by quantitation using ImageQuant software (Molecular 
Dynamics). Background-corrected volume integration of rectangles encompassing the 
footprint sites and a reference site at which DNase I reactivity was invariant across the 

25 titration generated values for the site intensities (I^te) * e reference intensity (I re f). 
The apparent fractional occupancy (0 a pp) of the sites were calculated using the 
equation: 
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I site 7 Iref 

where T° s j te and I° re f are the site and reference intensities, respectively, from a control 
lane to which no polyamide was added. The ([L] tot , 9 a pp) data points were fit to a 
Langmuir binding isotherm (eq 2, n=l) by minimizing the difference between 9 ap p and 
5 Gfit, using the modified Hill equation: 

QfA = 6nin + (Bnax - 9min) ^ } — (2) 

1 + Ka n [L] n tot 

where [L] tot is the total polyamide concentration, is the equilibrium association 
constant, and 9 m i n and 9 ma x are the experimentally determined site saturation values 
when the site is unoccupied or saturated, respectively. The data were fit using a nonlinear 
10 least-squares fitting procedure with 9 ma x, and 9 m i n as the adjustable parameters. All 
acceptable fits had a correlation coefficient of R > 0.97. At least three sets of data were 
used in determining each association constant. All lanes from each gel were used unless 
visual inspection revealed a data point to be obviously flawed relative to neighboring 
points. 

15 DNase I footprinting of ImlmPyPy-y-ImPyPyPy-P-Dp (2), ImlmPyPy-y- 

ImPyPyPy-p-RPR (4), and ImlmPyPy-y-ImPyPyPy-P-RPRRRR (6) was performed on 
the 250 bp EcoKUPvull restriction fragment of pJK6 (Kelly, et al. (1993) Proa Natl. 
Acad. ScL f USA 93, 6981-6985). 2 bound the match sites, 5'-TGGTCA-3' and 5'- 

TGGACA-3% with identical affinities within experimental error (AT a = 1.3 (±0.1) x 10 10 

-1 9 1 

20 M and 6.4 (±1.2) x 10 M" , respectively). 2 also demonstrated greater than 100-fold 

specificity for a single base pair mismatch site 5'-TGTACA-3 5 (K a < 5 x 10 7 Mf* 1 , 

mismatched base pair underlined). Similar affinity and a slight increase in specificity 

were observed for Arg-Pro-Arg polyamide 3. 5'-TGGTCA-3' and 5'-TGGACA-3' were 

bound by 3 (AT a = 4.6 (±0.2) x 10 10 M" 1 and 6.6 (±1.0) x 10 10 M" 1 , respectively) with 

25 greater than 450-fold specificity versus the mismatch site (K& = < 1 x 10 7 M" 1 ). The 

additional three terminal arginines of 6 generated a ten-fold increase in affinity relative to 
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2 coupled with a significant, loss in specificity for a single-base-pair mismatch. 6 bound 
5'-TGGTCA-3\ 5'-TGGACA-3\ and 5'-TGTACA-3' with affinities of 2.6 (±0.4) x 
10 10 M" 1 , 2.8 (±0.5) x 10 10 M" 1 and 1.9 (±0.8) x 10 10 M" 1 , respectively. 

Corresponding results were observed for DNase I footprinting of ImPyPyPy-y- 

5 PyPyPyPy-p-Dp (1), ImPyPyPy-y-PyPyPyPy-P-RPR (3) and ImPyPyPy-y-PyPyPyPy-p- 
RPRRRR (5) on the 229 bp Afll\IFsp\ restriction fragment of pJT8 (Trauger, et al. (1996) 
Nature 382, 559-561). 1 has been shown to bind its six base pair match site, 5'- 
AGTTAT-3', with an affinity of 3.5 (±0.8) x 10 9 M" 1 and 7-fold specificity versus a 
single base pair mismatch site 5'-AGTACT-3' (K a = 5.0 (±0.5) x 10 8 M~\ Table 3) 

10 (Trauger, et al. (1 996) Nature 382, 559-561). 



Table 3 

Equilibrium Association Constants (M' 1 ) 



Polyamide 


5'-AGTATT-3' 


5'-AGTACT-3' 


ImPyPyPy-y-PyPyPyPy-p-Dp 


3.5 x 10'* 


5.0xl0 8 * 


ImPyPyPy-Y-PyPyPyPy-P-RPR 


5.5 x 10 8 


9.2 x 10 7 


ImPyPyPy-y-PyPyPyPy-P-RPRRRR 


1.0 x 10'° 


3.4 x 10 9 



* Values reported for the six-base pair match (S'-AGTATT-T) and mismatch (5'- 
AGTACT-3') (mismatch underlined) sites are the mean values obtained from three Dnase 



15 I footprint titration experiments on the Aflll / Fsp I restriction fragment of pJT8. The 
assays were carried out at 22°C at pH 7.0 in the presence of 10 mM Tris»HCl, 10 mM 
KC1, 10 mM MgCl 2 , and 5 mM CaCl 2 . *From Trauger, et al. (1996) Nature 382, 559- 
561. 

The Arg-Pro-Arg polyamide 3 demonstrated only a slight loss in affinity and a 

20 similar specificity (K& = 5.5 (±1.5) x 10 8 M" 1 for 5'-AGTATT-3' and 9.2 (±0.4) x 10 7 

M~* for 5*-AGTACT-3*). Analogous to 6, the additional terminal arginines of 5 

provided a ten-fold increase in affinity for the match site (K& =1.0 (±0.2) x 10 10 M" 1 ) 

9 1 

and a severe loss in specificity for a single-base-pair mismatch (AT a = =3.4(±0.5)x 1 0 M ). 
Quantitative DNase I footprinting demonstrates that the addition of a C-terminal 
25 Arg-Pro-Arg tripeptide as in 3 and 4 does not alter the DNA binding properties of eight- 
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ring hairpin polyamides (1 and 2). However, Arg-Pro-Arg-Arg-Arg-Arg-polyamides, 5 
and 6, have increased binding affinity but no specificity for a single base pair mismatch 
site. DNase I footprinting and gel mobility shift analysis demonstrate that 5 and 6 retain 
their specificity versus double base pair mismatch sites. These results indicate that 

5 synthetic ligands may balance the benefits of additional charge with the consequence of 
lowered sequence specificity (Breslauer, et al. (1988) The origins of the DNA binding 
affinity and specificity of minor groove directed ligands: correlations of thermodynamic 
and structural data. In Structure and Expression (Vol. 2), DNA and Its Drug Complexes 
(Sarma, R. H. & Sarma, M. H. eds), pp. 273-289, Academic Press). For example, a 

10 distamycin analog modified with a decaaza decabutylamine moiety on a pyrrole nitrogen 
interferes with binding of a major groove transcription factor (Bruice, et al. (1997) 
Bioorg. Med. Chem. 5, 685-692). Unfortunately, the sequence specificity of this 
molecule which contains potentially eleven positive charges has not been reported. The 
results described here suggest that such a molecule may bind DNA with reduced 

15 sequence specificity. 

EXAMPLE 7 
Inhibition of DNA-Binding Proteins by Polyamides 
Oligonucleotides were synthesized and purified as previously described (Liberies, 
20 et al. (1996) Proc. Natl. Acad. Sci., USA 93, 9510-4). Polyamides were also 
synthesized as previously described and by Baird, et al. (1996, J. Am. Chem. Soc. 118, 
6141-6). Briefly, synthesis was performed in a stepwise manner from Boc-P-Alanine- 
Pam resin. Polyamides were then cleaved by reaction with ((dimethylamino)propyl)- 
amine and purified by HPLC chromatography. 
25 To prepare labeled DNA, plasmid DNA was digested with Hindlll and EcoRX for 

gel shift analysis, or Pvull and EcoRl for footprinting analysis, and simultaneously 
labeled with Sequenase 2.0, deoxyadenosine 5 ! -(a- 32 P)-triphosphate, thymidine 5 r -(a- 32 P)- 
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triphosphate, and nonradioactive deoxynucleoside triphosphates. The fragment 
waspurified by gel electrophoresis, treated with proteinase K, filtered, further extracted 
with phenol/chloroform, and precipitated with ethanol. 

For DNase I and MPE footprint reactions, all reactions were equlibrated at 22°C, 

5 pH 5.5, in the presence of 45 mM MES, 1 mM MgCl 2 , and labeled DNA for at least 24 
hours. Footprinting reactions were carried out as previously described (Hertzberg, et al. 
(1984) Biochemistry 23, 3934-3945; Brenowitz, et al. (1986) Proc. Natl. Acad. Sci., 
USA 83, 8462-8466.). 

To perform gel shift assays and titrations, all samples were equilibrated as above 

10 for at least 24 hours. One-tenth volume of 15% glycerol loading buffer was added and 
samples were run on a 10% polyacrylamide gel at 40°C, pH 5.5, with a 75:1 
acrylamide/bis-acrylamide ratio in 45 mM Mes and 1 mM MgCl 2 with buffer 
recirculation. Quantitation of isotherms was performed by plotting the Hgand 
concentration against the portion of labeled DNA in the bent conformation and curve fit 

15 using a Langmuir binding isotherm, as previously described (Brenowitz, et al. (1986) 
Proc. Natl. Acad. Sci., USA 83, 8462-8466; Bailly, et al. (1995) J. Molecular Biology 
253, 1-7). 

Two 15 bp purine tracts separated by one turn of the DNA helix (10 bp) were 
targeted by oligonucleotides containing two pyrimidine tracts (T and ? C) connected by a 

20 central T linker of size 2-9. It has been previously shown that such oligonucleotides bend 
DNA to a varying degree dependent upon the size of the linker (Liberies, D. A. & 
Dervan, P. B. (I 996) Proc. Natl. Acad. Sci., USA 93, 9510-4). The intervening 10 bp 
not targeted by the third strand oligonucleotide can be bound by a polyamide specifically 
designed for that sequence, as depicted in Figures 13 and 14. The binding affinity of the 

25 polyamide for its target sequence has previously been determined under similar 
conditions and reported to be 3.7 x 10 10 M" 1 for PA1 and 5.0 x 10 8 M" 1 for PA2, both 
much stronger than the affinity of either the unlinked 15mers for their sites, or the linked 
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bidentate oligonucleotides (Liberies, D. A. & Dervan, P. B. (1996) Proc. Natl. Acad. 
Sci., USA 93, 9510-4); Liberies, D. A. & Dervan, P. B. , unpublished data; Trauger, J. 
W., Baird, E. E. & Dervan, P. B. (1996) Nature 382, 559-61). Given the ability of 
polyamides to straighten DNA, it is demonstrated herein that targeting a polyamide to the 

5 intervening duplex of bent DNA can straighten it, displacing a ligand bound several base 
pairs distally, as depicted in Figure 15. 

PA1 at a concentration of 100 pM can displace a DNA bending third strand 
oligonucleotide at a concentration of 1 nM. This effect is clearly seen in Figure 16 for 
oligonucleotides with linkers of 2 and 3 T residues, bending DNA greater than 600. In 

10 these lanes, the more retarded bent structure is shifted to a less retarded structure with 
mobility similar to a double Y structure, where two oligonucleotides are bound by a 
single target DNA molecule with the polyamide presumably bound between them. For 
oligonucleotides with smaller bend angles, the polyamide does not displace the third 
strand oligonucleotide and the DNA distortion is probably not sufficient to preclude 

15 efficient polyamide binding. The specificity of this effect is seen in Figures 17A and 
17B, where only PA1, but not PA2 can displace oligonucleotide 2 at a concentration of 
100 pM, while neither can displace oligonucleotide 9. 

These experiments were performed with simultaneous addition of polyamide and 
oligonucleotide. Next, we sought to determine if the order of addition was important 

20 given the reported half life of a bound third strand oligonucleotide on DNA at 
approximately 12 hours under near physiological conditions (Maher, et al. (1990) 
Biochemistry 29, 8820-8826). As shown in Figure 18, simultaneous addition, 
preincubation with polyamide for one hour, or preincubation with third strand 
oligonucleotide for 1 hour made little difference in the ability of the polyamide to inhibit 

25 binding of the third strand oligonucleotide, where 1 hour is less than expected association 
rate for both oligonucleotide and polyamide (Maher, L. J., Dervan, P. B. & Wold, B. J. (I 
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990) Biochemistry 29, 8820-8826; Albert, et al. (1997), submitted). As such, polyamides 
may be useful in targeting prebound transcription factors in cells. 

It has been shown that the energy required to bend DNA with bidentate triple 
helical ligands is less than predicted by theoretical models of DNA as a smoothly bending 

5 wormlike chain with coulombic repulsion from phosphates placed at fixed distances 
(Akiyama, T. & Hogan, M. E. (1996) J. Biological Chemistry 271, 29126-29135; 
Liberies, D. A. & Dervan, P. B. , unpublished data; Bloomfield, et al. (1974) in Physical 
Chemistry of Nucleic Acids (Harper and Row, New York), pp. 159-166; Fenley, et al. 
(1992) J Physical Chemistry 96, 3963-3969). However, the prebending of target DNA 

10 for TBP, a general transcription factor, significantly altered its binding affinity (Parvin, et 
al. (1995) Nature 373, 724-727). The effects of prebending on the affinity of the 
polyamide by measuring the Ka, against triplex mediated bending was determined. 
Figure 1 7 shows a sample gel shift titration, where the measured Ka, for PA1 is 6.2 (±0.3) 
x 10 A, °, whereas K, for PA2 is <1 x 10 9 M" 1 . Ka, for PA1 is within experimental error of 

15 Ka measured by DNase I footprinting and shows that the energy for straightening DNA 
bent by a ligand with a lower binding affinity is minimal (Trauger, et al. (1996) Nature 
382, 559-61). Bending effects with TBP may be much larger, given the large number of 
protein-DNA contacts supporting the bent structure (Kim, et al. (1993) Nature 365, 512- 
520; Bond, et al. (1994) Biophysical Journal 67, 825-836). Additionally, TBP-bound 

20 DNA is bent towards the major groove, not the minor groove. 

To confirm that binding of the polyamide is indeed in the intervening duplex, 
DNase I and MPE footprinting were performed, as shown in Figure 17. While no binding 
in the intervening duplex and no displacement of the third strand oligonucleotide are seen 
for PA2, PA1 dislodges the third strand oligonucleotide, while protecting the intervening 

25 duplex. This supports the model of displacement of the bending ligand. 

The mechanism of action of polyamides is assumed to be directly through 
regidification of the double helix. Alternative modes of action to be considered are 
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simple steric blockage, or disruption of the solvation shell or counterion shell. Modeling 
of the triple helix-mediated bend shows a linker that is displaced from the intervening 
duplex to accomodate the bend angle, where shorter linkers are displaced further from the 
duplex than longer linkers. The inability of PA1 to displace bending oligonucleotides 
5 with longer linkers and bend angles less than 600 degrees argues against this explanation. 
Disruption of the salvation shell in the minor groove is likely to be steric and therefore 
unlikely to extend into the triple helical region where no polyamide is bound. 
Furthermore, this minor groove effect is unlikely to effect binding of a third strand in the 
major groove, where simultaneous binding has previously been demonstrated (Parks, et 

10 al. (1996) Bioorganic & Medicinal Chemistry 4, 1045-50). Similarly, the high charge 
density of double helical and triple helical DNA results in a large counterion shell around 
the molecule and in a very short Debye screening length where charge effects are unlikely 
to extend for multiple base pairs (Bond, et al (1994) Biophysical Journal 67, 825-836; 
Philpott, et al. (1995) J. Electrochem. Soc. 142, L25-L28.). 

15 By modifying DNA structure, generalizable sequence-specific polyamides have 

been designed to displace a DNA bending ligand at an adjacent but nonoverlapping 
binding site. This ability to displace DNA bending ligands through rigidification may be 
useful in the design of polyamides as artificial regulators of gene expression, providing a 
potentially valuable tool in molecular biology and human medicine. 

20 

As disclosed herein, the present invention provides the reagents and 
methodologies for the preparation and use of a variety of new polyamides comprising 
positive patch sequences for specific recognition of DNA in the minor groove and 
inhibiting the function of DNA-binding proteins that bind the major groove. Also 
25 provided is a methodology for determining the mechanism of action of inhibition of 
DNA-binding molecules. While a preferred form of the invention has been shown in the 
drawings and described, since variations in the preferred form will be apparent to those 
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skilled in the art, the invention should not be construed as limited to the specific form 
shown and described, but instead is as set forth in the claims. 
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CLAIMS 

We claim: 

1. In a polyamide which specifically binds to base pairs in the minor groove of a 
DNA molecule, the improvement comprising a positive patch consisting of a rigid 
group adjacent to a positively charged group such that a positive charge is 
delivered to the phosphate backbone or the major groove of a DNA molecule. 

2. A polyamide of claim 1 wherein the rigid group comprises a first and a second 
amino acid; said first amino acid being selected from the group consisting of 
arginine, proline, lysine, hydroxyproline and a derivative thereof; and said second 
amino acid being selected from the group consisting of proline, glycine, serine, 
threonine, leucine, isoleucine, valine, alanine, hydroxyproline and a derivative 
thereof 

3. A polyamide of claim 2 wherein said first amino acid is arginine and said second 
amino acid is proline. 

4. A polyamide of claim 1 wherein the positively charged, group comprises a 
synthetic or naturally occurring amino acid having a net positive charge. 

5. A polyamide of claim 1 wherein said positively charged group is selected from 
the group consisting of a primary amino group, secondary amino group, tertiary 
amino group, quartenary amino group, guanidinium group, and an amidinium 
group. 

6. A polyamide of claim 1 wherein said positively charged group is selected from 
the group consisting of arginine, lysine, histidine and a derivative thereof 

7. A polyamide of claim 1 wherein said positively charged group is arginine. 



WO 98/37087 



41 



PCT/US98/02684 



8. A polyamide of claim 1 wherein the positive patch comprises the amino acid 
sequence Arg-Pro-Arg. 

9. A polyamide of claim 1 wherein the polyamide has three or four carboxyamide 
binding pairs. 

10. A polyamide of claim 1 wherein the polyamide comprises an (R)-2,4- 
diaminobutyric acid hairpin turn that facilitates specific binding to base pairs in 
the minor groove of a DNA molecule. 

11. A polyamide of claim 10 wherein the R-2-amino group is derivatized to form an 
acid amide. 

12. A polyamide of claim 1 having the formula: 

X^yX^XeA 

wherein y is -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin linkage derived from y- 
aminobutyric acid or a chiral hairpin linkage derived from R-2,4-diaminobutyric 
acid; 

X,/X 6 , X/Xs, and X 3 /X 4 represent three carboxamide binding pairs which bind 
DNA base pairs and are selected from the group consisting of Hp/Py, Py/Hp, 
Py/Im, Im/Py, and Py/Py to correspond to the DNA base pair in the minor groove 
to be bound; and 

A represents a positive patch consisting of a rigid group adjacent to a positively 
charged group such that a positive charge is delivered to the phosphate backbone 
or major groove of a DNA molecule. 

13. A polyamide of claim 12 wherein the positive patch comprises the amino acid 
sequence Arg-Pro-Arg. 
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14. A polyamide of claim 1 having the formula: 

X 1 X 2 X 3 X 4 yX 5 X 6 X 7 X 8 A 

wherein y is -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin linkage derived from y- 
aminobutyric acid or a chiral hairpin linkage derived from R-2,4-diaminobutyric 
acid; 

X } /X 8 , X 2 /X 7 , X 3 /X 6 , and X 4 /X 5 represent four carboxamide binding pairs which 
bind DNA base pairs and are selected from the group consisting of Hp/Py, Py/Hp, 
Py/Im, Im/Py, and Py/Py to correspond to the DNA base pair in the minor groove 
to be bound; and 

A represents a positive patch consisting of a rigid group adjacent to a positively 
charged group such that a positive charge is delivered to the phosphate backbone 
or major groove of a DNA molecule. 

15. A polyamide of claim 14 wherein the positive patch comprises the amino acid 
sequence Arg-Pro-Arg. 

16. A polyamide of claim 1 having the formula: 

Xj X 2 X3X 4 X5yXgX 7 X g X9X j q A 

wherein y is -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin linkage derived from y- 
aminobutyric acid or a chiral hairpin linkage derived from R-2,4-diaminobutyric 
acid; 

X,/X 10 , X 2 /X 9 , X 3 /X 8 , X 4 /X 7 , X 5 /X 6 represent five carboxamide binding pairs which 
bind DNA base pairs and are selected from the group consisting of Hp/Py, Py/Hp 5 
Py/Im, Im/Py, and Py/Py to correspond to the DNA base pair in the minor groove 
to be bound; and 

A represents a positive patch consisting of a rigid group adjacent to a positively 
charged group such that a positive charge is delivered to the phosphate backbone 
or major groove of a DNA molecule. 
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17. A polyamide of claim 16 wherein the positive patch comprises the amino acid 
sequence Arg-Pro-Arg. 

18. A polyamide of claim 1 having the formula: 

X | X 2 X 3 X 4 X 5 X 6 yX 7 X 8 X 9 X , 0 X, jX , 2 A 
wherein y is -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin linkage derived from y- 
aminobutyric acid or a chiral hairpin linkage derived from R-2,4-diaminobutyric 
acid; 

X,/X, 2 , X 2 /X n , X 3 /X I0 , X/X 9? X 5 /X 8J X/X 7 represent three or four carboxamide 
binding pairs which bind DNA base pairs and are selected from the group 
consisting of Hp/Py, Py/Hp, Py/Im, Im/Py, and Py/Py to correspond to the DNA 
base pair in the minor groove to be bound; and 

A represents a positive patch consisting of a rigid group adjacent to a positively 
charged group such that a positive charge is delivered to the phosphate backbone 
or major groove of a DNA molecule. 

19. A polyamide of claim 18 wherein the positive patch comprises the amino acid 
sequence Arg-Pro-Arg. 

20. A tandem-linked polyamide having the formula: 

x 1 x 2 x 3s ^ x\x\x\ 

y y 

AX,X 5 xf X IJC 6 X' 5 X' 4 / X P 

wherein y is -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin linkage derived from y- 
aminobutyric acid or a chiral hairpin linkage derived from R-2,4-diaminobutyric 
acid; 

X/X* XtTXs, X 3 /X 4 , X'j/X'g, XyX 1 * X\/X\ represent carboxamide binding pairs 
which bind DNA base pairs selected from the group consisting of Hp/Py, Py/Hp, 
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Py/Im, Im/Py, and Py/Py to correspond to the DNA base pair in the minor groove 
to be bound; 

L represents an amino acid linking group selected from the group consisting of P- 
alanine and 5-aminovaleric acid (8); 

P represents a polyamide selected from the group consisting of 
X,X 2 X 3y X 4 X 5 X 6 , X 1 X 2 X 3 X 4y X 5 X 6 X 7 X 8 , X l X 2 X 3 X 4 X s yX fi X 7 X i X 9 X IO , and 
X 1 X 2 X 3 X 4 X 5 X 6 yX 7 X 8 X 9 X, 0 X n X J2 where X,-X 12 are independently selected from 
the group consisting of p-alanine, pyrrole, hydroxypyrrole and imidazole; and 
A represents a positive patch consisting of a rigid group adjacent to a positively 
charged group such that a positive charge is delivered to the phosphate backbone 
or major groove of a DNA molecule. 



21 . A tandem-linked polyamide having the formula: 

X ^2X3X4^ X'jX 2X3X4 

Y ^Y 
AX 8 X 7 X 6 X 5 LX'gX^XgX^ p 

wherein 7 is -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin linkage derived from 7- 
aminobutyric acid or a chiral hairpin linkage derived from R-2,4-diaminobutyric 
acid; 

X,/X 8 , X 2 /X 7 , X 3 /X 6 , X 4 /X 5 , X\/X' 8 , X' 2 /X' 7 , XyX' 6 , and X' 4 /X' 5 represent 
carboxamide binding pairs which bind DNA base pairs selected from the group 
consisting of Hp/Py, Py/Hp, Py/Im, Im/Py, and Py/Py to correspond to the DNA 
base pair in the minor groove to be bound; 

L represents an amino acid linking group selected from the group consisting of p~ 
alanine and 5-aminovaleric acid (8); 

P represents a polyamide selected from the group consisting of 
X]X 2 X 3 7X 4 X 5 X 6 , XjX^X^X^X-jXg, X 1 X 2 X 3 X 4 X 5 7X 6 X 7 X 8 X 9 X 10 , and 
X I X 2 X 3 X 4 X 5 X 6 7X 7 X 8 X 9 X IO X 11 X I2 where X r X l2 are independently selected from 
the group consisting of P-alanine, pyrrole, hydroxypyrrole and imidazole; and 
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A represents a positive patch consisting of a rigid group adjacent to a positively 
charged group such that a positive charge is delivered to the phosphate backbone 
or major groove of a DNA molecule. 

22. A tandem-linked polyamide having the formula: 

3(1X2X3X4X5 X'jX^X^X^X's 

Y Y 
X 10 X 9 X 8 X 7 X 6 LX'jgX^XgX^Xg p 

wherein y is -NH-CH 2 -CH 2 -CH 2 -CONH- hairpin linkage derived from y- 
aminobutyric acid or a chiral hairpin linkage derived from R-2,4-diaminobutyric 
acid; 

X|/X 10 , Xj/X^, X 3 /X 8 , X 4 /X 7 , X 5 /X 6 , X'j/X'jq, X' 2 /X' 9 , X'j/X'g, X' 4 /X' 7 , X'j/X^ 
represent carboxamide binding pairs which bind DNA base pairs selected from 
the group consisting of Hp/Py, Py/Hp, Py/Im, Im/Py, and Py/Py to correspond to 
the DNA base pair in the minor groove to be bound; 

L represents an amino acid linking group selected from the group consisting of (i- 
alanine, 5 -amino valeric acid (5) and a derivative thereof; 
P represents a polyamide selected from the group consisting of 
X l X 2 X 3 yX 4 X 5 X 6 , X,X 2 X 3 X 4 yX 5 X 6 X 7 X8, X,X2X 3 X 4 X 5 yX 6 X 7 X8X 9 X 105 and 
X 1 X 2 X 3 X 4 X 5 X 6 yX 7 X 8 X 9 X 10 X n X 12 where X r X l2 are independently selected from 
the group consisting of p-alanine, pyrrole, hydroxypyrrole and imidazole; and 
A represents a positive patch consisting of a rigid group adjacent to a positively 
charged group such that a positive charge is delivered to the phosphate backbone 
or major groove of a DNA molecule. 
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23. A tandem-linked polyamide of claim 1 having the formula: 



Xj X 2 X 3 X 4 X5X 6 



X» y» V 1 V V 1 V 



Xi 2^1 1 X] oX 9 X 8 X 7 




LXj 2X f ] j X', 0X9X3X7 



1 



P 



wherein y is -NH-CH 2 -CH 2 ~CH 2 -CONH- hairpin linkage derived from y- 
aminobutyric acid or a chiral hairpin linkage derived from R-2,4-diaminobutyric 
acid; 



Xj/X 12 > x 2 /x n , X 3 /X 10 , x 4 /x 9 , x 5 /x g , x 6 /x 7 , x'|/x* 12 , xyx^, x r 3 /x* 10 , x , 4 /x i 9 , 



X's/X'g and X' 6 /X' 7 represent carboxamide binding pairs which bind DNA base 
pairs selected from the group consisting of Hp/Py, Py/Hp, Py/Im, Im/Py, and 
Py/Py to correspond to the DNA base pair in the minor groove to be bound; 
L represents an amino acid linking group selected from the group consisting of p~ 
alanine, 5-aminovaleric acid (5) and a derivative thereof; 



P represents a polyamide selected from the group consisting of 

X 1X2X37X4X5X5, X 1 X 2 X 3 X 4 yX 5 X 6 X 7 X 8 , X 1 X 2 X 3 X 4 X 5 yX 6 X 7 X 8 X 9 X 10 , and 



the group consisting of P-alanine, pyrrole, hydroxypyrrole and imidazole; and 
A represents a positive patch consisting of a rigid group adjacent to a positively 
charged group such that a positive charge is delivered to the phosphate backbone 
or major groove of a DNA molecule. 



X,X 2 X3X 4 X5X 6 yX 7 XgX 9 X l0 X n X l2 where X r X, 2 are independently selected from 
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24. A polyamide of claim 1 selected the group consisting of: 
ImPyPyPy-y-PyPyPyPy-P-RPR; 
ImlmPyPy-y-ImPyPyPy-P-RPR; 
ImPyPyPy-y-PyPyPyPy-p-RPRRRR; 
ImlmPyPy-y-ImPyPyPy-P-RPRRRR; 
ImPyPyPy-y-PyPyPyPy-p-R; 
ImPyPyPy-Y-PyPyPyPy-p-RP; 
ImPyPyPy-y-PyPyPyPy-p-RGR; 
ImPyPyPy-Y-PyPyPyPy-P-R^R; 
ImPyPyPy-Y-PyPyPyPy-P-APR; 
ImPyPyPy-Y-PyPyPyPy-p-KPR; 
ImPyPyPy-Y-PyPyPyPy-P-RPK; 
ImPyPyPy-Y-PyPyPyPy-p-C7-RPR; and 

the pharmaceutically acceptable salts thereof. 

25. A method of inhibiting gene expression comprising contacting a regulatory 
sequence of a gene with a polyamide of claim 1 . 

26. A method of inhibiting gene expression comprising contacting a DNA molecule 
with a polyamide of claim 1 whereby the DNA molecule is conformationally 
constrained. 
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